HR-WSD: System Description for All-Words Word Sense Disambiguation on a Specific Domain at SemEval-2010
نویسنده
چکیده
The document describes the knowledgebased Domain-WSD system using heuristic rules (knowledge-base). This HRWSD system delivered the best performance (55.9%) among all Chinese systems in SemEval-2010 Task 17: All-words WSD on a specific domain.
منابع مشابه
TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain
Word sense disambiguation (WSD) is one of the main challenges in Computational Linguistics. TreeMatch is a WSD system originally developed using data from SemEval 2007 Task 7 (Coarse-grained English Allwords Task) that has been adapted for use in SemEval 2010 Task 17 (All-words Word Sense Disambiguation on a Specific Domain). The system is based on a fully unsupervised method using dependency k...
متن کاملRACAI: Unsupervised WSD Experiments @ SemEval-2, Task 17
This paper documents the participation of the Research Institute for Artificial Intelligence of the Romanian Academy (RACAI) to the Task 17 – All-words Word Sense Disambiguation on a Specific Domain, of the SemEval-2 competition. We describe three unsupervised WSD systems that make extensive use of the Princeton WordNet (WN) structure and WordNet Domains in order to perform the disambiguation. ...
متن کاملSemEval-2010 Task 17: All-Words Word Sense Disambiguation on a Specific Domain
Domain portability and adaptation of NLP components and Word Sense Disambiguation systems present new challenges. The difficulties found by supervised systems to adapt might change the way we assess the strengths and weaknesses of supervised and knowledgebased WSD systems. Unfortunately, all existing evaluation datasets for specific domains are lexical-sample corpora. With this paper we want to...
متن کاملHIT-CIR: An Unsupervised WSD System Based on Domain Most Frequent Sense Estimation
This paper presents an unsupervised system for all-word domain specific word sense disambiguation task. This system tags target word with the most frequent sense which is estimated using a thesaurus and the word distribution information in the domain. The thesaurus is automatically constructed from bilingual parallel corpus using paraphrase technique. The recall of this system is 43.5% on SemEv...
متن کاملCFILT: Resource Conscious Approaches for All-Words Domain Specific WSD
We describe two approaches for All-words Word Sense Disambiguation on a Specific Domain. The first approach is a knowledge based approach which extracts domain-specific largest connected components from the Wordnet graph by exploiting the semantic relations between all candidate synsets appearing in a domainspecific untagged corpus. Given a test word, disambiguation is performed by considering ...
متن کامل